Data Mining; A Conceptual Overview

نویسنده

  • Joyce Jackson
چکیده

This tutorial provides an overview of the data mining process. The tutorial also provides a basic understanding of how to plan, evaluate and successfully refine a data mining project, particularly in terms of model building and model evaluation. Methodological considerations are discussed and illustrated. After explaining the nature of data mining and its importance in business, the tutorial describes the underlying machine learning and statistical techniques involved. It describes the CRISP-DM standard now being used in industry as the standard for a technology-neutral data mining process model. The paper concludes with a major illustration of the data mining process methodology and the unsolved problems that offer opportunities for research. The approach is both practical and conceptually sound in order to be useful to both academics and practitioners.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ancient Gold Mining Activities in India - An Overview

Gold was obtained through washing or panning of the river sands during initial periods of civilisation. With the advent of knowledge of metallurgical processing of ores it was recovered through mining of in-situ quartz reefs, and then from auriferous sulphide ores. The metal mining activities are evidenced in the form of large number of ‘ancient metal mines’ or ‘old workings’ and ‘placer mining...

متن کامل

Mining at Detail Level Using Conceptual Graphs *

Text mining is defined as knowledge discovery in large text collections. It detects interesting patterns such as clusters, associations, deviations, similarities, and differences in sets of texts. Current text mining methods use simplistic representations of text contents, such as keyword vectors, which imply serious limitations on the kind and meaningfulness of possible discoveries. We show ho...

متن کامل

Conceptual Knowledge Processing with Google

This paper introduces a tool for Conceptual Knowledge Processing with Google. The featured prototype, called FooCA, tries to combine the advantages of two research disciplines, Web Mining and Formal Concept Analysis (FCA). Web Mining techniques are used to preprocess search results retrieved via Google, presenting the formal context in an interactive cross table. A new formal context can be ite...

متن کامل

Comparison and evaluation of source code

Program source code substantially is structured and contains semantically rich programming constructs such as 6 variables, functions, data structures, and program structures which indicate patterns. Mining source code by using different data 7 mining techniques to extract the valuable hidden patterns is the new revolution in software engineering. Over last decade many 8 tools and techniques hav...

متن کامل

A Case for Supplementing Evidence Base Medicine with Inductive Clinical Knowledge: Towards a Technology-Enriched Integrated Clinical Evidence System

Clinical evidence exist in modalities other than published clinical literature, such as clinical data ranging from patient clinical profiles to clinical trials; clinical experiences of eminent medical practitioners; and medical knowledge bases encapsulating knowledge about patient care, healthcare guidelines and protocols, clinical workflow and so on. We propose a technology-enriched strategy t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CAIS

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2002